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An important task required to build a scalable, fault-tolerant quantum computer is to efficiently 
represent an arbitrary single-qubit rotation by fault-tolerant quantum operations. Traditionally, 
the method for decomposing a single-qubit unitary into a discrete set of gates is Solovay-Kitaev 
decomposition, which in practice produces a sequence of depth 0(log'^(l/e)), where c ~ 3.97 is the 
state-of-the-art. The proven lower bound is c = 1, however an efficient algorithm that saturates 
this bound is unknown. In this paper, we present an alternative to Solovay-Kitaev decomposition 
employing state distillation techniques which reduces c to between 1.12 and 2.27, depending on 
the setting. For a given single-qubit rotation, our protocol significantly lowers the length of the 
approximating sequence and the number of required resource states (ancillary qubits). In addition, 
our protocol is robust to noise in the resource states. 
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Given recent progress in quantum algorithms, quan- 
tum error correction, and quantum hardware, a scalable 
quantum computer is becoming closer and closer to re- 
ality. For many proposed quantum computer architec- 
tures, e.g., topological systems based on the braiding of 
non-Abelian anyons [IH5] or the surface code model based 
on code deformation [51 [7] , so-called Clifford operations 
and stabilizer state preparations or measurements can be 
implemented efhciently and accurately. However, these 
operations alone are not sufficient for quantum univer- 
sality since they can be simulated classically [HHTU] . One 
technique to achieve quantum universality is to use magic 
state distillation [TTHT3] to augment the set with a single 
non-Chfford operation, e.g., the single-qubit tt/8 gate, 
T. This augmented set can be used to approximate any 
single-qubit unitary using Solovay-Kitaev decomposition 

The Solovay-Kitaev theorem [Kj states that for any e 
and single-qubit gate U, U can be approximated to pre- 
cision e using 0(log'^(l/e)) gates drawn from a universal, 
discrete gate set, where c is a small constant. State- 
of-the-art implementations of Solovay-Kitaev decompo- 
sition result in c ^ 3.97 [HJ [TC], resulting in an average 
decomposition sequence with hundreds to thousands of 
T gates [IS] . Each T gate requires a number of copies of 
a quantum magic state (Hadamard -|-l-eigenstate), 
where the number depends on the specific state distilla- 
tion protocol and purity of the state [TTl - fTS] . Therefore, 
it is especially important to minimize the number of T 
gates when decomposing a rotation, since each T gate 
requires additional ancillary qubits for implementation. 

In this paper, we present an alternative protocol to 
Solovay-Kitaev decomposition that allows the implemen- 
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tation of single-qubit rotations through the use of dis- 
tilled magic l-ff) states. We show that the resources re- 
quired by our protocol are substantially fewer than the 
resources required by state-of-the-art implementations of 
Solovay-Kitaev decomposition 16 , in both the number 
of gates and the number of quantum magic states neces- 
sary to apply an arbitrary single-qubit unitary. 



I. MAGIC STATE DISTILLATION AND 
IMPLEMENTATION OF ROTATIONS 

Solovay-Kitaev decomposition [TH - fTC] enables the ap- 
proximation of any gate using an approximately universal 
set of elementary gates, e.g., {H,T,A{X)}, where A{X) 
denotes the controlled-NOT gate. In particular, one can 
approximate any single-qubit unitary operation using the 
set {H,T}. Magic state distillation is then used to pro- 
duce the magic \H) states necessary to implement the T 
gate. 

We call a state |^) magic if given n noisy copies of {tp) 
and the ability to perform perfect Clifford operations, we 
can obtain, or "distill" , a purer copy of {ip) from a Clifford 
circuit applied to the n noisy copies of {ip). We can then 
obtain even purer states which are arbitrarily close to 
the perfect state by applying the protocol recursively [TTl - 
[T5] . These distilled states can be used to implement non- 
Clifford operations, e.g., the T gate, as described below. 

We briefly review how to perform an arbitrary rotation 
about the Z-axis using a resource state, and how to ap- 
ply the T gate. We assume that Clifford operations are 
applied perfectly, since they can be implemented fault- 
tolerantly, and that the resource states are very close to 
pure. One can use the protocols of [TTJ [12] to perform 
the initial distillation in order to obtain arbitrarily pure 
resource states for input. We focus on the -1-1 eigenstate 
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FIG. 1. Circuit randomly implementing a rotation of angle 
±9 around the Z(X)-axis. 



of the Hadamard operation, H, 

|i/>=cos^|0)+sin||l), 

and omit the cost of the initial distillation of the \H) 
state. We concentrate on single-qubit states found in ei- 
ther the XZ- or XF-plane of the Bloch sphere. Note that 
one can easily move a state from one plane to the other 
by the application of the Clifford HSHX operation. 
Suppose we start with states \Z(p)) and \'4)): 



\z{9)) = |o; 

\^)=a\ 



e'^ll) 
+ h\l) 



The circuit to implement a rotation around the Z-axis, 
presented in Fig. [l] leads to the two-qubit state 

\Z{e))\'iP) = a|00) +6|01) +ae'''|10) +fee''''|ll) 

a |00) + b 111) + ac'^ |10) + be'" |01) . 

Upon measurement of the first qubit in the computa- 
tional basis, we obtain 
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4a|0)-f foe'^ll), 

>ae"'|0)+6|l) = a|0) +6c^''' |1) , 



each with probability 1/2. Thus, the angle of rotation is 
chosen at random to be 9 or —6, up to global phase. An 
analogous circuit performs a rotation about the X-axis. 
Similar circuits can be found in [7 . 

As an important example of this procedure, consider 
the Xy-plane version of the \H) state: 

|Z(7r/4)) = HSHX \H) = |0) + c*^/" |1) . 

Using the circuit in Figure [l] we can implement a Z- 
rotation of angle ±7r/4, producing at random either the 
T gate or its adjoint, T^. In this particular case, we 
can deterministically apply the desired gate T or by 
applying the phase gate S, since ST^ — T. In general, 
however, this deterministic correction will not be possi- 
ble. 



FIG. 2. Two-qubit circuit used to obtain new \Hi) states from 
initial resource states \Ho}. Upon measuring the outcome, 
the output state is \Hi+i). Upon measuring the 1 outcome, 
the output state is \Hi-i). 



any single-qubit rotation. We assume that we are pro- 
vided with perfect copies of \H). We would like to mini- 
mize the number of \H) states required to implement an 
arbitrary single-qubit rotation, since these distilled states 
can be costly to produce. 

Consider the circuit of Fig. [2j One can easily verify 
that it measures the parity of the two input qubits and 
decodes the resulting state into the second qubit. We 
begin by considering the two inputs to be \H) states. We 
define = | and \H) = \Ho) = cose'olO) +sin6io|l). 
The circuit begins as: 
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Upon measurement of the first qubit, we have 
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We define di such that 
cos(9i |0) sin 6*1 |1) = 



cos 6*0 |0) -I- sin 6*0 |1) 



cos"* 



sin-* 9o 



from which we deduce cot 9i — cot^ 9q. We define \Hi) = 
cos^i |0) -f sin6'i |1), a non-stabilizer state obtained from 
\H) states, Clifford operations, and measurements. If 
the outcome of the measurement is 1, then we obtain a 
stabilizer state and discard the output (see Fig. [2]). 

The two measurement outcomes occur with respective 
probabilities 
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II. NEW STATES FROM \H) STATES 

In this section, we show that we can use a very simple 
two-qubit Clifford circuit to obtain other non-stabilizer 
states using only \H) states as an initial resource, and 
then show that these states enable the approximation of 



We now recurse on this protocol using the non- 
stabilizer states produced by the previous round of the 
protocol as part of the input to the circuit of Fig. [2] We 
define 



\Hi) = cos 61, |0) -I- sin 6*, |1) 
cot 6*4 = coU+^ 6*0. 
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FIG. 3. Process of obtaining other non-stabilizer states from 
initial \H) states. A copy of \Hi) and \Ho) probabilistically 
yield a copy of \Hi-i) or \Hi+i) using the circuit of Fig. [2] 
Each step along the ladder costs one copy of |-ffo), except the 
first one which costs two. 



If we use as input a copy of \Hi) and a copy of \Hq), we 
have 

\Ho)\H,) = cos6locos6', |00) +sine'osin6',; 111) 

+ sin 6*0 cos 9i \ 10) + cos sin 6i |01) , 

cos 00 cos 0, |00) + sin 6*0 sin 9, |01) 
+ sin 6*0 cos 9i |10) + cos 6*0 sin di |11) . 

Upon measurement of the first qubit, we have 

(cos6''|0) + sin 61' 1 1)), 
(cos0"|O)+sinr |1)), 

where 

cot 9' ~ cot 9.1 cot 9q — cot*+^ 9q = cot 9i+i, 
cot 6*" — cot 9i tan 6*0 = cot* 9q ~ cot 9i_i. 

Thus, if we measure 0, we obtain the state |iJi+i) and if 
we measure 1, we obtain the state \Hi^i). The probabil- 
ity of measuring is given by 

Po,i = cos^ 9i cos^ 9q + sin^ 9i sin^ 9o. 

Note that | < po,» < cos^ f = 0.853.... We can 
view this recursive process as a semi-infinite random walk 
with biased non-homogeneous probabilities, as Fig. [3] il- 
lustrates. Every time a step is taken along this "ladder" 
of states, one copy of \H) is consumed, except at the first 
node of the ladder (the production of state |-ffo)) when 
we require two copies of \H): if the outcome 1 is mea- 
sured at the first node, we discard the output and start 
with two new copies of \H). 

Table U lists the rotations obtained from the first few i 
recursions, using the \Hi) states, and Figs. |4] [5] illustrate. 
Note that there is a factor of two difference between the 
angle 9i involved in the description of the state and the 
rotation applied, e.g., the \Hq) state is over 6*0 = |, and 
can be used to implement a j rotation. Also, as < 
9i < J (Vi), the discontinuity of the cotangent is never a 
problem. 



III. NUMERICAL STUDY OF RANDOM 
^-ROTATIONS 

In this section, we numerically study the cost of imple- 
menting single-qubit rotations. Although the circuit in 
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TABLE I. Rotation by angle 20i implementable using an 
\Hi) = cos^i |0) +sin^i |1) state. 




FIG. 4. Red dots: Rotation by angle 20i implementable using 
an \Hi) = cos6'i |Q) -|- sin^i |1) state. 

Fig. [1] randomly applies 9 or —9, we show that our pro- 
tocol still results in an efficient application of the desired 
Z-rotation. 

Assume that we have the ideal case, where 9i ~ 26'j_|_i. 
In this hypothetical scenario, if we try to apply some 
rotation using \Hi) and it fails, then we can correct the 
gate by applying a rotation using \Hi_i). If this gate 
also fails, then we follow with \Hi^2), and so on. There 
are two crucial facts to point out. First, the probability 
of failing n times in a row scales as 1/2", i.e., it decays 
exponentially with n, such that the expected number of 
iterations is well-behaved. Second, and very importantly, 
if a T gate fails (using a |fl"o) state), we can always correct 
deterministically by applying the phase gate S (which is 
a Z-rotation by e"/^). 

Unfortunately, 9i ^ 20^+1 ; however, this assumption is 
not too far from the truth (see Fig. [5|. Since we only 
require that a gate be approximated to a given precision, 
we can actually apply any Z-rotation rapidly and with 
good accuracy. First, apply the |i/i)-rotation that gets 
you closest to your target angle and then recurse. The 
next gate to apply will depend on whether the current 
gate succeeded or not. Due to the property discussed in 
the previous paragraph, we will rapidly converge towards 
our target angle. 

We now present simulation results of obtaining the Z- 
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FIG. 5. Dots: States obtainable by recursively using the cir- 
cuit of Fig. [2] and only \H) states as initial input. Full line: 
exponential decay fit, 6*, ~ 2.4f"°-***^\ 



rotation Z((f)), where is chosen randomly, in order to 
characterize the efficiency of the protocol proposed in 
previous sections. The simulation proceeds as follows: 

1. Set desired accuracy e. 

2. Randomly pick a target rotation angle < (p < 2tt. 

3. Find the state \Hi) such that 29i is close to (f>. 

4. Simulate an instance of the ladder to obtain that 
state and add its cost to the offiine cost. 

5. Apply a rotation using the \Hi) state and the circuit 
of Fig. [TJand add one to the online cost. 

6. Recurse on steps 3 through 5 until the desired ac- 
curacy is reached. 

We define the accuracy of the applied rotation V com- 
pared to the target rotation U as 

maxD{U U\ V {^\ V''), 

where 

D{p,a) = itr (^^J{p-<t)Hp-<j)^ 

is the trace distance between states p and cr. If U and V 
are rotations about the same axis, one can show that for 
small angles of rotation, which will always be our case, 
this reduces to the difference of rotation angles, e = A^. 
In [in], the distance measure used is 



DiU,V) 



2- |tr(C/yt)| 



In the case of rotations about the same axis 
reduced to ^/T 



it can be 
A(/)/\/2forsmaU Ac/). This 



|cos(A^)| 

conversion between the two measures is important since 
we later compare performance. 

To compare the resource cost of our protocol to 
Solovay-Kitaev decomposition, we define an online and 
offline cost to apply a unitary gate. The online cost, 
Con, is the expected number of non-Clifford gates, or 
\Hi) states, required to implement the unitary gate on a 
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FIG. 6. Target accuracies are chosen such that 10~^^ < e < 
lO""* and the sample size is ~ 1.8 x 10''. The clouds of points 
are used to fit the data according to a linear fit. We obtain 
c ~ 1.29 and c' ~ 2.27. 



qubit. The offiine cost, Cog, is the total number of dis- 
tilled \H) states required to obtain all of the intermediate 
\Hi) states used to perform the given unitary operation, 
that is, the sum of the \H) states used for each ladder pro- 
cess. In our resource costs, we do not include the initial 
cost to distill \H) states. For Solovay-Kitaev decomposi- 
tion, the offline cost is always and the online cost is the 
total number of T and gates in the decomposition. 

We ran the simulation for target accuracies ranging 
between 10~^^ < e < 10~*, each time considering a new 
random angle to produce a sample of ^ 1.8 x 10"* instances 
of this protocol. Just like in the case of Solovay-Kitaev 
decomposition, we suppose that 

1, 



Con - In^(-), 
e 

Coff^ln^'(i), 

where Con and CoS are the online and offline costs, re- 
spectively, such that 

In Con ^ clnln(-), 

InCoff- c' Inln(^). 

The results are given in Fig. |6] Fits are also presented 
from which we deduce that c ^ 1.29 and c' ^ 2.27 for 
our protocol. 

As discussed in Section |V) both of these scalings repre- 
sent a significant improvement over the best implementa- 
tion, to our knowledge, of Solovay-Kitaev decomposition 
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which was itself a significant improvement over the 
previous implementation of {14i . 

Note that one can implement any single-qubit unitary 
U using three rotations around the X- and Z-axes |10) : 

U cx X{a)Z{/3)X{-f), 

for some angles a, (3,^. We have explicitly shown sim- 
ulaton results for Z-rotations, however X-rotations can 
be obtained at the same cost using the X-rotation cir- 
cuit given in Fig. [T] Thus we can use our protocol to 
produce each of the three rotations, and produce any de- 
sired single-qubit unitary operation. 



A. Other states 

To further reduce the resource costs and their respec- 
tive scalings, we show that we can use different Clifford 
circuits to produce new non-Clifford states that can be 
used as initial resources for the previously presented pro- 
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and 



tocols. We first introduce three new states 

discuss how to combine them into the described protocol. 

Consider the circuit of Fig. [7] It is a Clifford circuit to 
which we input four copies of | H) . The measurement out- 
come 000 occurs with probability 3(2 + \/2)/32 w 0.320, 
otherwise the output is discarded. If the measurement 
yields result 000, then the produced state is 



=cos0°|O)+sin(/.^ 

-3^2 



4>l 





0.446. 



Since the probability of success is 0.320 and that every 
trial consumes four copies of the average cost to pro- 
duce |V'o) is 12.50 \H) states. This circuit was designed 
to measure the stabilizer code presented in Table [TTl An- 
other interesting state can be obtained from the same cir- 
cuit, substituting one of the input states by a |-|-) state as 
is illustrated by Fig. [Sj The measurement outcome 000 
is obtained with probability (6 + V^)/32 w 0.232. The 
corresponding output state is 

= cos (/.J |0)+ sinful |1), 
2^2 



TT 
2 



cot 



3-^72, 



0.570. 



Since the probability of success is 0.232 and every trial 
consumes three copies oi\H), the average cost to produce 
l^o) is 12.95 \H) states. Fig. [o] presents another useful 
circuit. The measurement outcome 000 is obtained with 
probability 11/32 w 0.344. The corresponding output 
state is 
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FIG. 7. Circuit to produce j-^Q) states. The probability of 
success is 0.320 and every trial consumes four copies of \H} 
such that the average cost is 12.50 to produce a copy of |V'o)- 



s 


± 





1 


2 


3 


So 




X 


z 


X 




Sl 


4- 




X 


z 


X 


S2 


-f 


X 




X 


Z 


z 


-f 


z 


z 


z 


Z 



TABLE II. The stabihzer code decoded by the circuit of Fig.[7| 



The probability of success is 0.344 and every trial con- 
sumes four copies of \H) such that the ave rage cost to 

presents the 
that are de- 
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produce li^o) is 11.64 \H) states. Table 
stabilizer code in terms of its generators S 
coded by the circuit. 

We will use these states as input states to the circuit 
given in Fig. [2j where one of these states is used in place 
of the \Ho) input state. We start with a copy of \i{jq) and 
a copy of l-ffo)- If measurement outcome 1 is obtained, 
the state is discarded. Otherwise, we obtain 

IV'i) =cos01|O) -f sin^lll), 
cot = cot cot 6*0 . 

Similarly to the \Hi) states, we define 



i'ij = cos(j)l |0) -|-sin0^ |1) , 
cot 0f — cot (l>Q cot* Oq. 

If we input a copy of 4'i ) ^-^id a copy of |-ffo); we obtain 
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FIG. 8. Circuit to produce \tp})) states. The probability of 
success is 0.232 and every trial consumes four copies of \H) 
such that the average cost is 12.95 to produce a copy of \'ipo)- 
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FIG. 9. Circuit to produce states. The probability of 

success is 0.344 and every trial consumes four copies of \H) 
such that the average cost is 11.64 to produce a copy of |i/'o)- 
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TABLE III. The stabilizer code decoded by the circuit of 
Fig.[9l 



such that the output state obtained is, depending on mea- 
surement outcome, 



New "ladders" of states can be obtained using the 
tpQ^^'"^) states as inputs in place of the \Hq) states. 

Fig. [To] shows the four ladders. Table |IV| lists the rota- 
tions obtained from the first few i recursions and Fig 11 
illustrates. We see that the set of possible rotations is 
more dense. We reproduced the numerical experiment 
of Section [III] with basic offline costs of 12.50, 12.95 and 
11.64 for yPo), \'^h)^ ^^"^ |V'o)j respectively. The results 
are presented in Fig. 13 Since the set of states is denser, 
we expected improved scalings for both the online and 
offline costs. This is indeed the case, we find c ^ 1.12 



and 



1.75. 



However, the basic offline costs of our new states 
|'0o) ^-re significantly higher; for precision ~ 10~^, even 
though the online cost is smaller using the new states, 
the offline cost is still smaller if we restrict ourselves to 
the simpler scheme using only \H) states. For the proto- 
col using the new input states to reduce both the online 
and offline costs, we need to cons ider precisions smaller 
then e « 1.28 x lO'^, see Fig.]l2[ 

The reason why we do not consider other measurement 
outcomes in the circuits considered in this section is that 
in general, potential errors on the \H) states are amplified 
by the circuit. Output states in these cases might still 
prove useful, but a careful analysis of the evolution of 
errors must be conducted. 
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FIG. 10. Dots: States obtainable by recursively using the 
circuit of Fig. [2] with initial resource states \H), \tl)°), 
and 1?/'^). 




FIG. 11. Dots: Rotations implement able using \Hi), ji/'i'), 
1^?) states. 



B. Minimizing the online cost 

In this section, we aim to minimize the online cost, at 
the price of potentially increasing the offline cost. 

The protocol up to this point can be summarized as 
follows. Suppose one wants to implement a Z-rotation 
of an arbitrary angle on a logical state \ip). One has 
to implement a sequence of j rotations {Z{29i^)} on {ip) 
using the sequence of states such that Z{<j)) « 
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TABLE IV. Rotations 
\H,) ,\tP°) states. 
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FIG. 12. Dashed line: Offline (top) and online (bottom) 
costs for the scheme using only \H} states. Dotted lines: Of- 
fline (top) and online (bottom) costs for the scheme using 
{\H) , l^/'o) , |V'o) ) |V'o)} states. The two top curves cross at 
e ^ 1.28 X 10"^ 




(a)Fit: ln(C^,n) = -0.78 + 1.12 ln(ln(l/e)). 
Ln(Coff) 
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(b)Fit: In(C^ff) = 0.54 + 1.75 ln(ln(l/e)). 

FIG. 13. Target accuracies are chosen such that 10~ 
10^'* and the sample size is ~ 1.8 x 10*. The clouds of points 
are used to fit the data according to a linear fit. We obtain 
c ~ 1.12 and c' ~ 1.75, improving on the results of Fig|6] 



Hj Z{2e^^). The online cost is given by 

Consider instead the following protocol to implement 
the same rotation by angle (j). Prepare offline the state 
\Z{(j))) from a copy of |0). To achieve this, use the proto- 
col described in the previous paragraph to rotate |0) to 
\Z{<j))). Note that you can implement it offline because 
you are applying rotations on an ancilla state. Then, use 
\Z{(j))) online to apply the desired rotation. With proba- 
bility i, the rotation Z{(f)) is applied and the online cost 
is 1. If it fails, correct for it by preparing offline the 
state \Z{2(j))). Again, with probability |, the overall ro- 
tation Z{<j)) is applied and the online cost is 2. If it fails, 
prepare offline |Z(40)), and so on. The probability that 
a number n of iterations is required before success de- 
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(a)Fit: (C^'J = 1.99. 
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(b)Fit: In(C^jj) = 1.13-|-1.751n(ln(l/e)). 

FIG. 14. Target accuracies are chosen such that lO"^'^ < 
e < 10~* and the sample size is ~ 1.8 x 10"^. (a) The cloud 
of points is averaged to get the expected number of online 
rotations, (b) The cloud of point is used to perform a linear 
fit. 



creases exponentially with n. This process is a negative 
binomial of parameter P = \ and the expected number 
of online rotations before success goes as ~ - =2. 

We numerically simulated this process for various ran- 
dom angles, <S < 27r, and accuracies, 10~^^ < e < 
10"'*. As Fig. 14 illustrates, we note that there is no 



significant change in the online cost for different values 
of e. Moreover, the expected number of online rotations 
to apply is roughly two, the result one would expect in 
this situation. The scaling of the offline cost is the same 
as that of the previous scheme, as expected. Also, even 
though the scaling is the same, the actual values of the 
offline cost are bigger. The shift is 1.13 — 0.64 = 0.59, see 
Figs. |13(b)| and |14(b)[ which corresponds to a factor of 
gO.59 ^ 1.80. One would expect a slightly bigger shift of 
the offline cost by In 2 since we repeat the scheme twice 
on average. This suggests that there might exist some 
favorable correlations between the expected offline cost 
of an angle and of twice that angle 20. 



IV. ERRONEOUS STATES 

In this section, we determine the effect of errors on 
our resource \H) states, and their effect on the produced 
\Hi) states. A priori, the errors might be amplified by 
the two-qubit circuit of Fig. [2] however we show this is 
not the case. 

The probabilistic and non-homogeneous nature of the 
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FIG. 15. Evolution of the trace distance between imperfect 
Pi and perfect \Hi) states. Circles: data for p = 10~*, 1 < 
i < 28. Squares: data for p = 10~^, 1 < i < 22. Diamonds: 
data for p — 10~*, 1 < i < 16. The full lines are exponential 
decay fits: (2.08 * 10"^) x 2.31"' using points 18 < i < 28, 
(1.63 * 10~^) X 2.28"' using points 18 < i < 22 and (1.26 * 
10"'^) X 2.24"' using points 13 < i < 16 for the circle, square 
and diamond data set, respectively. Sample size is 1000. We 
conclude that if the initial resource state has desired accuracy, 
then this is also true of all derived resource states. 



presented protocol is not well suited for an analytical 
study of the evolution of errors on \Hi) states. Instead, 
we rely on a numerical study for three different types of 
errors. We use the trace distance on states p and tr, 

Dip,a) = hri^ip-a^ip-a)), 

to measure the accuracy of the imperfect \Hi) states. We 
assume Clifford operations are perfect and that errors can 
only occur on the \H) states. 

We consider three types of erroneous states. First, we 
assume that the mixed state, pg, is perfectly along the 
line joining the center of the Bloch sphere and the the 
perfect state, i.e., 

Poip) - {l-p)\Ho){Ho\ +p\- Ho){-Ho\, 

where |— i?o) — sin | |0) — cos ^ |1) is the state orthogonal 
to |_ffo) ■ We denote the imperfect version of \Hi) obtained 
from Pq states as pf. If Clifford operations are perfect, 
we can always bring any mixed state into this form using 
twirling ^2j. However, for the protocol to be of practical 
interest, we require it to remain stable under the two 
following types of errors, where we assume that the state 
is pure, but that the rotation is slightly off of the desired 
axis by S: 



S]X 



COS I 



1 / TT 7T TT 

PniS) — - ( / + sin — cosSX + sin — sin(5F + cos —Z 
^ 2 V 4 4 4 

We numerically generated pseudo-random instances of 
the scheme to produce \Hi) states for different values of 
i and for different noise strengths. We considered 1000 
instances for each of the three types of errors and noise 
strengths 10""*^, 10"^ and 10"^. Figures 15 and 16 show 
that the protocol actually reduces the amplitude of pos- 
sible errors on the resource \H) states, such that if we 




5 10 15 20 25 

(a)Fits: (1.17* 10"^) X 2.31"', 
(1.03* 10"^) X 2.29"' and 
(7.50* 10"*) X 2.25"' 




5 10 15 20 25 

(b)Fits: (8.28* 10"*) X 2.31"', 
(7.32* 10"'=') X 2.30"' and 
(5.30* 10"**) X 2.25"' 

FIG. 16. Distances between the ideal \Hi) states and the 
imperfects states and pi respectively. Sample size is 1000. 
e = 10"", 10"^ 10"* in both cases. 



start with \H) meeting our target accuracy, all the sub- 
sequent derived \Hi) states will also meet it. This even 
suggests that for bigger values of i, one could use noisier 
\H) states and still achieve the desired accuracy. This 
could make a dramatic difference if it enables one to re- 
duce the number of distillation recursions necessary to 
prepare the \H) states. 

We note a very similar behavior for the three types of 
errors. The exponential decay of the distance between 
erroneous and ideal states confirms that the errors are 
well behaved under the proposed protocol. We note that 
the bases for the exponential decay of the errors are com- 
parable, but smaller, than the basis for the exponential 
decay of the angle implemented. So, for a given error 
rate, there exits a point where the angle of rotation im- 
plemented by \Hi) for some i is going to be comparable 
or smaller to the error on that angle. However, this is 
not a problem in practice since for, e.g., e ^ lO"**, we 
find i ^ 150. For this value of i the angle is 9i ^ 10^^^. 



V. COMPARISON TO SOLOVAY-KITAEV 
DECOMPOSITION 

In this section, we compare the performance of the 
Solovay-Kitaev decomposition of [TB] and that of the 
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ln{Coff) = -0.72 + 2.271n{ln(l/.)) + ln3 i„(t;9-^) ^ 0.54 + 1.75 ln(ln(l/e)) + In 3 



FIG. 17. Full line: Cost of Solovay Kitaev decompostion 
(SKD) of (a) random ^-rotations and (b) random unitaries 
as a function of the precision e. Dotted line: Offline costs of 
(a) random Z-rotations and (b) random unitaries. We assume 
that, loosely speaking, a random unitary is the composition of 
three random rotations, hence the additional factor of three. 
Dashed line: Online cost of (a) random Z-rotations and (b) 
random unitaries. (a) For practical values of e, the online cost 
is significantly smaller than SK. The offline cost is lower than 
SK when e < 8.71 x 10"''. (b) A gain, for practical values of 
e, the online cost is significantly smaller than SK. The offline 
cost is lower than SK when e < 2.67 x 10~^. 



FIG. 18. Full line: Cost of Solovay Kitaev decompostion 
(SKD) of (a) random Z-rotations and (b) random unitaries 
as a function of the precision e. Dotted line: Offline costs, us- 
ing {\H) , \ , \ '^^) , initial resources, of (a) random 
Z-rotations and (b) random unitaries. Dashed line: Online 
cost, using {l-ff) , IV'") ) IV'"'") I i^^)} 8.S initial resources, of (a) 
random Z-rotations and (b) random unitaries. (a) For prac- 
tical values of e, the online cost is significantly smaller than 
SK. The offline cost is lower than SK when e < 4.41 x 10"". 
(b) Again, for practical values of e, the online cost is signif- 
icantly smaller than SK. The offline cost is lower than SK 
when e < 1.03 x lO"'^. 



schemes presented in this article. In order to do this, 
we first consider different Z-rotations of angles 7r/16, 
7r/128, and 7r/1024 and different accuracies 10""*, 10"^ 
and 10-12 

Table |V] lists the expected costs. Con and CoS are 
respectively the online and offline costs to implement 
the gate using only states. C'^^ and C^jj refer 
to the costs for the version of the scheme thats uses 
{\H) , \ipQ) , I^/iq) , \ipo)} as initial resource states. Csk 
refers to the extrapolated cost to implement these gates 
using the results from |TB]. This extrapolated cost aver- 
ages over all unitaries (c ~ 3.40). This is optimistic since 
the results of [16] suggest that Z-rotations are actually 
harder to implement (c ^ 4.3). Note that in theory, the 
cost of this algorithm is 0(log^(l/e), where c = 3.97 [H] , 



In all cases, the online cost is minimal when our pro- 
posed scheme enhanced by {|V'o) j [''/'o) ' IV-'o)} is used. 
For rougher precision, e.g., 10"^, the offline cost might 
be such that the total cost is still minimal for the Solovay- 
Kitaev implementation of [16j . For finer precision, e.g., 
IQ-^ or IQ-i^, our proposed protocol becomes very ad- 
vantageous, since the cost of the Solovay-Kitaev decom- 
position becomes prohibitive. 

We also compare the average behavior of the different 
schemes as Figs. [17] and [18] illustrate. We first start by 
noting that, loosely speaking, a random unitary is com- 
posed of three random rotations, such that the curves 
presented previously in Fig. |12| must be shifted by In 3 in 
general. Fig. [17] plots the fit for the Solovay-Kitaev de- 
composition (solid line), the online cost (dashed) and of- 
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c 


e = 10 * 


e = 10 


e = 10 


7r/16 


CsK 


43.83 


2646 


29120 




Con 


10.20 


24.52 


41.95 




Con 


5.88 


12.48 


19.38 




Coff 


73.06 


349.8 


874.4 




CofS 


98.29 


306.1 


595.0 


7r/128 


CsK 


53.84 


2879 


29530 




Con 


5.47 


18.96 


39.27 




Con 


3.32 


9.27 


16.91 




Coff 


49.18 


313.0 


923.9 




Coff 


52.60 


234.1 


560.8 


7r/1024 


CsK 


128.1 


2594 


15075 




Con 


7.99 


23.08 


42.93 




Con 


3.00 


8.37 


15.23 




Coff 


77.42 


381.3 


969.1 




Coff 


65.75 


245.5 


530.7 



TABLE V. Con and Coff are respectively the online and offline 
costs to implement the Z-rotation by angle 9 using only |_ff) 
states, to precision e. Con and C^ff refer to the costs for the 
version of the scheme thats uses {|-ff) , | V'o) > j'Ao) > |^o)} as 
initial resource states. Csk refers to the extrapolated cost to 
implement these gates using the results from [16] . 



fline cost (dotted). For all practical accuracies, the online 
cost of our proposed scheme is consistently the smallest. 
However, the offline cost becomes advantageous when 
e < 8.71 X lO"** for Z-rotations and e < 2.67 x 10"'^ for 
random unitaries. Fig. [T8| plots the same for the scheme 
with additional initial resource states. Similarly, the of- 
fflne cost becomes advantageous when e < 4.41 x 10^* 
for Z-rotations and e < 1.03 x 10"^ for random unitaries. 



VI. CONCLUSION 

We have proposed an alternative protocol to Solovay- 
Kitaev decomposition that results in significantly smaller 
resource costs, in both the number of required resource 
states and the depth of the circuit. We have shown a 
significant improvement on average in the value of c, 
and in many cases the number of distilled states and 
rotations required to implement a single-qubit unitary 
gate are reduced. Another advantage of our protocol is 
that the number of resources required is a "smoother" 
function of accuracy, whereas Solovay-Kitaev decompo- 
sition is step-like in nature because of the recursion pro- 
cess used in practice. However, note that our protocols 
and Solovay-Kitaev decomposition are not exclusive. It 
might be that some unitaries are better implemented us- 
ing Solovay-Kitaev decomposition, while our scheme is 
better suited for Z-rotations, which occur, among other 
algorithms, in the quantum Fourier transform. 

As future research, there are likely a variety of other 
circuits that enable other "ladders" of states. One natu- 
ral extension would be to use the SH eigenstates distilled 
using the protocols of [iTJ [13] . Another extension would 
be to perform a systematic study of "small" Clifford cir- 
cuits. Finally, we note that implementing a rotation by 
choosing the state which results in an angle closest to the 
target angle is a simple way of achieving our goal, but it 
is surely suboptimal. An important research direction 
would be to optimize the sequence of angles required to 
implement the desired rotation. 
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